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Abstract 

Background: Shigella flexneri is the major cause of shigellosis in the developing countries. The 0-antigen 
connponent of the lipopolysaccharide is one of the key virulence deternninants required for the pathogenesis of 5. 
flexneri. The glucosyltransferase and/or acetyltransferase genes responsible for the modification of the 0-antigen are 
encoded by temperate serotype converting bacteriophage present in the 5. flexneri genome. Several serotype 
converting phages have previously been isolated and characterized, however, attempts to isolate a serotype 
converting phage which encodes the modification genes of serotypes 4a strain have not been successful. 

Results: In this study, a novel temperate serotype converting bacteriophage SflV was isolated. Lysogenisation of 
phage SflV converted serotype Y strain to serotype 4a. Electron microscopy indicated that SflV belongs to 
Myoviridae family. The 39,758 bp genome of phage SflV encompasses 54 open reading frames (orfs). Protein level 
comparison of SflV with other serotype converting phages of 5. flexneri revealed that SflV is similar to phage Sfll and 
SfV. The comparative analysis also revealed that SflV phage contained five proteins which were not found in any 
other phages of 5. flexneri. These proteins were: a tail fiber assembly protein, two hypothetical proteins with no 
clear function, and two other unknown proteins which were encoded by orfs present on a moron, that presumably 
got introduced in SflV genome from another species via a transposon. These unique proteins of SflV may play a 
role in the pathogenesis of the host. 

Conclusions: This study reports the isolation and complete genome sequence analysis of bacteriophage SflV. The 
SflV phage has a host range significantly different from the other phages of Shigella. Comparative genome analysis 
identified several proteins unique to SflV, which may potentially be involved in the survival and pathogenesis of its 
host. These findings will further our understanding on the evolution of these phages, and will also facilitate studies 
on development of new phage vectors and therapeutic agents to control infections caused by 5. flexneri. 
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Background 

Shigella diarrhoeal illness continues to be a leading 
cause of morbidity and mortality worldwide, particularly 
in the developing countries. Annually there are esti- 
mated 7 million cases of shigellosis globally, of which 1.1 
million result in death. Shigella encompasses four sub- 
groups {S. boydii, S, dysenteriae, S, flexneri, and S, sonnei) 
[1]. Although S, dysenteriae type 1 is associated with the 
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most severe form of the disease and high mortality rate 
when epidemics occur, most of the deaths are attributed 
to the endemic form of the disease, which is most often 
caused by S, flexneri [2]. 

Based on the O-antigen structure of the lipopolysac- 
charide (LPS) molecule, S, flexneri is currently divided 
into 17 serotypes [3-5]. All the serotypes, with an excep- 
tion of serotype 6, share a common O-antigen backbone 
comprising of repeating tetrasaccharide units (A^- 
acetylglucosamine attached to three rhamnose residues) 
[6]. Modifications to the basic O-antigen backbone, 
which occur through glucosylation and/or 0-acetylation 
of one or more sugars within the repeating unit, form 
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the basis of serotype conversion. 0-acetylation is known 
to occur only on rhamnose III residue, while glu- 
cosylation can occur on any of the three rhamnoses or 
on the A^-acetylglucosamine. The genes which mediate 
the addition of glucosyl or 0-acetyl residues to the spe- 
cific sugars are encoded by temperate bacteriophage. 
These phage are integrated into the host chromosome, 
forming a prophage that stably expresses serotype con- 
version genes [7]. 

The glucosylation loci in these phages contain three 
genes [gtrA, gtrB, and ^^r(type)] that are arranged in a 
single operon known as a gtr cluster. The first two 
genes, gtrA and gtrB, which encode proteins involved in 
transferring the glucosyl group from cytoplasm into the 
periplasm, are highly conserved and interchangeable 
between different serotypes. Whereas, the third gene, 
^^r(type) is unique to each serotype and encodes a 
glucosyltransferase that is responsible for attaching the 
glucosyl group to a specific sugar unit of the O-antigen 
via a specific linkage [8-14]. In contrast, the addition of 
an 0-acetyl group is mediated by only one protein, an 
0-acetyltransferase encoded by the oac gene of phage 
Sf6 [15]. 

Five serotype converting bacteriophages: Sfl, Sfll, SfV, 
SfX and Sf6, have previously been isolated and studied 
[12,15-19]. Lysogenisation of these bacteriophages have 
been shown to convert serotype Y strains to serotype la, 
2a, 5a, X and 3b, respectively, and the gtrl, gtrll, gtrV, 
gtrXy and oac genes, which are involved in the serotype 
conversion have been characterized. Analysis of the 
phage genomes has also revealed that in all these phages, 
the genes involved in serotype conversion are located 
adjacent to the int attP region, and the phage DNA (ex- 
cept Sf6) has integrated into the thrW gene, in the pro- 
lac region of the host genome [7]. 

The type IV O-antigen modification genes from S, 
flexneri 4a strains have been characterized and shown to 
convert serotype Y strain to serotype 4a. Also, analysis of 
the fragment of DNA upstream of the gtrA gene in sero- 
type 4a strain has revealed the presence of bacteriophage 
int gene and attP site [8]. However, attempts to isolate a 
serotype converting SflV bacteriophage have not been 
successful to date. 

In this study, we report the isolation and char- 
acterization of bacteriophage SflV, induced from a 
wildtype S, flexneri 4av strain. The complete sequence 
of phage SflV was determined and compared to other 
known Shigella serotype converting phages at the pro- 
tein level. Our results suggest that the SflV phage was 
similar to Sfll and SfV phages, and there was a consider- 
able conservation in the genomic architecture of all the 
phages. Five proteins identified were unique to SflV, 
four of which could potentially play a role in pathogen- 
esis of its host. 



Methods 

Bacterial strains, phage and media 

Bacteriophage SflV was isolated from S, flexneri 4av 
strain (SFL1522, International Centre for Diarrhoeal 
Disease Research, Bangladesh) using the UV irradiation 
protocol described by Adams et al. [8]. Plaque assays 
were performed on the lysate after induction. Bacterio- 
phage stocks were prepared by picking a single plaque, 
propagating on serotype Y strain (SFL 124) [20], and pre- 
cipitating phage using polyethylene glycol, as described 
in Sambrook et al. [21]. 

Phage was routinely propagated in NZCYM broth and 
S. flexneri strains were grown in Luria-Bertani (LB) broth 
or LB agar. 

Electron microscopy 

For electron microscopy, phage preparations were nega- 
tively stained with 2% phosphotungstic acid (pH 7.0) 
and electron micrographs were taken with a Hitachi 
H600 transmission electron microscope. 

Determination of phage host range and lysogens 

To determine the host range of bacteriophage SflV, dif- 
ferent dilutions of the purified phage stock were made in 
SM buffer (100 mM NaCl, 50 mM Tris-HCl pH 7.5, 
8 mM MgS04 and 0.002% (w/v) gelatin). 100 \A of over- 
night culture of the required bacterial host strain was 
then spread on the LB agar plate. Once the plates were 
dry, 5-10 \A of the phage stock of different dilutions was 
spotted on the plate. The plates were incubated over- 
night at 37°C and were examined for appearance of the 
clear zones around the phage drop. 

Lysogens of SflV were obtained using the method pre- 
viously described by Mar vis et al. [12]. 

Genomic DNA purification and sequencing 

Bacterial genomic DNA was isolated using GE Healthcare 
genomic DNA isolation kit (GE Healthcare), according to 
the manufacturer s instructions. Complete genome of the 
host strain SFL 1522 was sequenced using 250 bp paired- 
end, Miseq, lUumina sequencing (performed at the 
Ramaciotti Centre, University of New South Wales). Reads 
generated were then assembled into contigs using CLC 
Genomics Workbench (Ver 5.5.1, CLC Bio). The contigs 
containing the gtrA and the int gene were extracted. The 
gaps between the desired contigs were filled by designing 
primers binding to the end of each contig and amplifying 
the segment using SFL1522 genomic DNA. PCR product 
obtained was further purified using the gel purification kit 
(Promega) according to the manufacturers instructions. 
Purified PCR product was then sequenced using the ABI 
Prism BigDye Terminator Cycle Sequencing Ready reac- 
tion kit, and the reactions were performed in a GeneAmp 
2400 thermal cycler (Perkin Elmer) according to the 
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manufacturers protocol Reactions were run on an ABI 
Prism 377 Automated sequencer at the Biomedical Re- 
sources Facility, John Curtin School of Medical Research, 
Australian National University. 

Sequence assembly and analysis 

The contigs were assembled using CLC Genomics Work- 
bench (Ver 5.5.1, CLC Bio) and the open reading frames 
(orfs) were determined using the NCBI ORF finder pro- 
gram and CLC Main Workbench (Ver 6.6.2, CLC Bio). 
The genome was scanned for tRNA using tRNAscan-SE 
Search server [22], and the Rho-independent terminators 
were identified using ARLOND terminator finding pro- 
gram [23]. 

Genome comparison 

Multiple genome alignment was performed using Mauve 
[24], and the protein level alignments were perfomed 
using ClustalW [25]. The accession numbers for the 
phages used for comparative genomics were: Entero- 
bacteria phage Sfl (JX509734), Shigella phage Sfll 
(NC_021857), Enterobacteria phage SfV (NC_003444), 
phage SfX (NC_017328), Shigella phage Sf6 (NC_005344), 
Enterobacteria phage cdtl (NC_009514), Enterobacteria 
phage lambda (NC_001416), Enterobacteria phage phiP27 
(NC_003356), Enterobacteria phage Mu (NC_000929), 
Stxl phage (AP005153), Salmonella phage epsilon34 
(NC_011976), Salmonella phage ST64B (NC_004313), 
Salmonella phage ST64T (NC_004348), Enterobacteria 
phage ST104 (NC_005841), and Salmonella phage SEl 
(DQ003260). 

Availability of supporting data 

The nucleotide sequence of SflV phage reported in this 
article has been deposited in the GenBank database as 
accession number KC8 14930. 

Results and discussion 

Isolation of bacteriophage SflV 

SflV phage was induced from serotype 4av, strain 
SFL 1522. Clear plaques were produced when the lysate 
after induction from SFL 1522 was plated onto the indi- 
cator strain SFL 124 (serotype Y). The colonies isolated 
from turbid areas of the plaques were assessed by slide 
agglutination, PCR and SflV infection. Colonies showing 
positive agglutination with both: group 3,4 (specific for 
serotype Y) and type IV antisera (specific for serotype 
IV), were assumed to be lysogens. Results of PCR ampli- 
fication and SflV infection indicated that the lysogens 
contained the SflV-specific gtrlVgervQ and were resistant 
to SflV infection. 



Morphology of SflV 

Electron microscopy was used to determine bacterio- 
phage SflV morphology. Electron micrographs revealed 
that the SflV phage particle has an isometric head of ca. 
55 nm and a long contractile tail of ca. 101 nm (Figure 1). 
It is a typical group A phage, of the family Myoviridae, 
and order Caudovirales, according to the morphological 
classification of Bradley [26]. Appearance of SflV also re- 
sembles other serotype converting phages of Shigella, like 
SfV [17], Sfll [12] and a newly isolated phage Sfl [19]. 

Host range of phage SflV 

The host range of the SflV bacteriophage was deter- 
mined with twelve serotypes (la, lb, Ic, 2a, 2b, 3a, 3b, 
4a, 4b, 5a, X, and Y) of S. flexneri. Of the serotypes ex- 
amined, SflV was capable of infecting serotypes: la, lb, 
Ic, X and Y. This suggests that, in comparison to other 
morphologically similar Shigella phages, SflV has a much 
broader host range than Sfl (infects only X and Y sero- 
types) [19], and Sfll (infects serotypes 3a, 5a and Y), but 




Figure 1 Electron micrograph of bacteriophage SflV. Electron 
micrograph of phage SflV, stained with 2% phosphotungstic acid. 
Bar 100 nm. 
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not SfV which infects seven serotypes (la, lb, 2a, 2b, 3b, 
4b, and Y) [27]. Surprisingly, there is very little similarity 
between the host range of these bacteriophages, with sero- 
type Y being the only common serotype infected by all the 
four phages. 

Determination of the phage SflV genomic sequence 

The complete DNA sequence of bacteriophage SflV was 
determined using a combination of Illumina (Miseq) and 
PGR sequencing. High throughput sequencing of the 
host (SFL1522) DNA, generated 4,32,905 pairs of reads, 
which is about 25 fold genome coverage. The genomic 
sequence was then de-novo assembled into 700 contigs 
and scaffolds using the CLC Genomics Workbench (Ver 
5.5.1, CLC Bio). The accuracy of the CLC assembly was 
further verified using another assembly tool. Velvet [28], 
and by mapping the reads back to the contigs. 

SflV phage genome was distributed between two contigs 
of 27 kb and 14 kb. The gap between the two contigs was 
then closed by targeted PCR and subsequent sequencing, 
yielding a contiguous prophage SflV genome flanked by 
the int and gtrA gene. 46 bp attp site (identical to phage 
Sfl and SfV) was identified on either ends of the prophage 
DNA, and was used to construct a circular SflV bacterio- 
phage genome. The linear orientation of the sequence 
starting from small terminase gene and ending with the 
cos site was then used for fiirther analysis. 

The genome of SflV consists of 39,758 bp with an 
average GC content of 50.29%, which is very similar to 
its host (50.90%). The bioinformatics analysis revealed a 
total of 54 orfs, taking up to 90.05% of the coding cap- 
acity of this virus. A plausible Shine-Dalgarno sequence 
and a start codon (44 with ATG, 9 with GTG and 1 with 
TTG) were identified for all the orfs. Thirty eight puta- 
tive SflV genes are transcribed rightwards (on the gen- 
etic map), while sixteen genes transcribed leftwards 
(Figure 2). The protein corresponding to each orf was 



then compared with the non-redundant protein databases 
using the NCBI Blast? tool. The database search yielded 
matches with other proteins with identity values between 
33% to 100% (Additional file 1: Table SI). The genome of 
bacteriophage SflV was also scanned for the presence of 
tRNAs and Rho-independent termiantors. The predicted 
Rho-independent terminators are summarized in Figure 2, 
however, no tRNA genes were identified. 

Comparison of SflV with other serotype converting 
phages of S. flexneri 

The genome of bacteriophage SflV was first compared 
to 15 previously sequenced phages from various hosts: S. 
flexneri (5), E, coli (5), and Salmonella (5), using Mauve 
(Additional file 2: Figure SI). The results of multiple 
alignment revealed that the SflV bacteriophage had high 
degree of DNA sequence similarity with other serotype 
converting bacteriophages of S, flexneri in particular Sfl, 
Sfll and SfV, but not with SfX and Sf6. Considerable 
similarity was also obtained with two E, coli phages: 
phage phiP27 and phage cdtl, and one Salmonella phage 
ST64B. Rest of the lambdoid phages showed limited 
silmilarity to SflV, with phage Mu showing no similarity 
at all. To get a better understanding of the degree of re- 
latedness among the Shigella phages, bacteriophage SflV 
encoded proteins were compared with those of other 
serotype converting phages of S, flexneri (Figure 3). 

orf-1 and orf-2, encode the phage SflV small and large 
terminase subunits, respectively, which are required for 
packaging bacteriophages dsDNA into the cavity of 
preformed protein shells called proheads [29]. The 
terminases of bacteriophage SflV show high amino acid 
level identity (>90%, Figure 3) with the terminases of 
bacteriophage Sfl, Sfll and SfV, suggesting that the 
packaging mechanism is same in these phages. This was 
verified by the genome sequence analysis of phage SflV 
which revealed a cos site identical to phage Sfl, Sfll and 
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Figure 2 Genetic map of bacteriophage SflV. The SflV phage genome is shown with a scale in bp. The arrows above the scale represent 
predicted open reading frames, with the arrowheads indicating the direction of transcription. Putative function and names of genes are given 
above. The attP/cos site is indicated in red. The Rho-independent terminators are indicated as black knobs below the scale. 
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Figure 3 Comparison of SflV witii other serotype converting phages of S. flexneri. Protein corresponding to eacli orf was compared witli its 
equivalent protein in Sfl, Sfll, SiV, Sf6 and SfX pliages using ClustalW. Tlie arrows below the SflV map are colour coded and represent the amino 
acid sequence identity between the SflV proteins and their equivalents in other phages. The colour codes are:- red: 90-100% identity, yellow: 
80-89% identity, orange: 70-79% identity, green: 60-69% identity, blue: 50-59% identity and grey: 10-50% identity. The purple arrows in the SflV 
map represent the unique proteins of phage SflV. 
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SfV. Thus, like Sfl, Sfll, and SfV, phage SflV uses bac- 
teriophage lamba Uke packaging in which virion ge- 
nomes are generated by cos site specific DNA cleavage 
at the beginning and at the termination of every pack- 
aging event. This results in all the virions with DNA of 
identical, specific ends, generating only precise frag- 
ments on restriction digestion. Furthermore, very lim- 
ited similarity observed between the terminases of SflV, 
Sf6 and SfX could be because of the different packaging 
mechanisms used by Sf6 and SfX. The packaging mech- 
anism of bacteriophage Sf6 and SfX is like phage P22, in 
which the terminases bind to the pac recognition site, 
and the DNA is packaged by the headful mechanism 
[15,29,30]. 

Databases homology searches suggested that orf-3, orf- 
4 and orf-S encode the bacteriophage portal protein, 
prohead protease and major head protein, respectively 
(Additional file 1: Table SI). Protein alignment results 
have shown that these three proteins of SflV share >90% 
identity with their counterparts in phage Sfl and Sfll, 
but very little identity was obtained with those of SfV. 
However, it should be noted that even though the SfV 
proteins show little amino acid similarity, they still be- 
long to the HK97 family of proteins (like Sfl, Sfll and 
SflV). Thus, like several other proteins from tailed bac- 
teriophages, these could be divergent homologues show- 
ing conserved structure and function [31]. In contrary, 
viral capsid assembly proteins in bacteriophage Sf6 and 
SfX are different. The coat protein of Sf6 is similar to that 
of bacteriophage P22 and is known to include surface ex- 
posed domain (a conserved feature specific to P22 like 



bacteriophages) with an "HK97-like" core [32]. In addition, 
Sf6 bacteriophage also contains a separate scaffold protein 
and three other head completion genes [15]. 

Analysis of proteins encoded by orf-6 through orf-21 
suggests that this region of the SflV genome is involved 
in bacteriophage tail structure and assembly (Additional 
file 1: Table SI). The major difference between the tail 
proteins of SflV and other 3 phages (Sfl, Sfll and SfV) 
was ORF-12, ORF-13 and ORF-22. While ORF-12 (a 
hypothetical protein) and ORF-13 (a tail tape measure 
protein) of SflV showed <50% identity with Sfl (ORFs 15 
and 16), Sm (ORFs 15,16), and SfV (ORFs 13,14), ORF- 
22 which is a predicted tail fiber assembly protein was 
completely missing in Sfl, Sfll and SfV. Since phage tail 
fibers bind to the receptors on the bacterial host for ad- 
sorption [33], presence of an additional tail fiber assem- 
bly protein in SflV could explain the fact that the host 
range of SflV does not match with the above mentioned 
three phages. Unlike Sfl, Sfll, SflV, and SfV (which be- 
long to group Al), phage Sf6 and SfX are members of 
class C bacteriophages, consisting of icosahedral head 
and a short tail. Thus, unlike SflV, Sf6 and SfX have only 
one protein in the tail spike module [34]. 

SflV phage orfs 25-27 encode for the O-antigen modi- 
fication cluster, orf'25 corresponds to the gtrlVgene, the 
serotype specific modification gene [8]. orfs 26 and 27, 
encode for the^^rB/^and^^M/vj respectively. As expected, 
no protein level similarity was observed between the type 
specific modification genes of the six bacteriophages. 
However, the gtrA and gtrB genes were highly conserved 
(with >90% protein level identity). The integrase and 
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excisionase proteins encoded by orf'28 and orf'29 of SflV 
were also conserved between Sfl, Sfll, SflV and SfV. How- 
ever, when compared with the Int and Xis of Sf6 and SfX, 
very little identity was observed with proteins of both Sf6 
and SfX phage. Different Int protein of Sf6 could explain 
its integration into agrW, instead of thrW tRNA gene [7], 

orfs 30-34 encode for hypothetical proteins with no 
clear functions. While these proteins were completely 
missing in Sfl, Sf6 and SfX, the same cluster of unknown 
genes was observed in SfV (orfs 28-32) and was found 
to share >98% protein level identity with SflV proteins. 
The homologues of ORFs -30, -31 and -32 were present 
in Sfll as well (ORFs 32, 33 and 34), however, ORF-33 of 
SflV was missing in Sfll, and ORF-34 showed only 85% 
identity with ORF35 of Sfll. 

In all the lambdoid phages, the regulatory switch that 
determines whether the phage will follow a lytic or lyso- 
genic cycle includes an operator/promoter sequence and 
two major proteins similar to the Cro and cl repressor 
proteins of phage lambda. Analysis of the SflV genome 
revealed that the orfs -37 and -38 encode for the cl and 
Cro proteins, which are 100% identical with the cl and 
Cro of SfV. The cl and Cro proteins of Sfl, Sfll, Sf6 and 
SfX, however, showed very limited identity to SflV. Ana- 
lysis of the SflV sequence for additional lambda regula- 
tory elements CII, cIII, or N proteins, revealed that like 
Sfll and SfV, SflV does not contain any additional regu- 
latory factors, although these factors were present in the 
other three phages (Sfl, Sf6 and SfX). 

The blastp search results have also revealed that the 
SflV orf -39 and -40 encode for a putative DNA- 
binding regulatory protein, and an unknown protein, 
which had >90% identity to Sfll ORFs 39 and 40 pro- 
teins. The replication module of SflV which stretches 
from orf-41 to orf-45, showed high identity to the corre- 
sponding proteins of Sfll and SfV, but was not present 
in the Sfl, Sf6 and SfX genome sequences. Intrestingly, 
most of the replication module of SflV was similar to 
Sfll and SfV, however, the methylase gene present in Sfll 
and SfV replication module was not found in SflV. 

The antitermination Q protein encoded by orf-46 reg- 
ulates the expression of lamdoid phage lytic proteins by 
modifying the RNA polymerase near the phage late gene 
promoter allowing it to pass one or more terminators 
[35]. The antitermination Q protein of SflV was very dif- 
ferent from all the other phages, sharing <50% identity 
with its equivalents. The lytic cassette of SflV consists of 
holin, lysin, Rz and Rzl proteins (encoded by orfs 50- 
53). These proteins share high degree of identity with 
their counterparts in Sfll and SfV, but are different to 
their equivalents in Sfl, Sf6 and SfX. Like bacteriophage 
lambda, the lytic cassette of Sf6 and SfX contains holin 
with a dual start motif (resulting in production of holin 
and anti-holin by the same gene) that regulates the 



timing of lysis [36]. However, the mechanism of control- 
ling the timing of lysis in Sfl, Sfll, SflV and SfV is not 
known, orf-54 the last gene of SflV encodes for a HNH 
endonuclease domain protein which had homologues in 
Sfl, Sfll, and SfV, but not in Sf6 and SfX. Given that the 
orf'54 is a member of phage late genes and its homologues 
were only observed in the Shigella phages lacking the dual 
start motif of holin, it could be hypothesized that this pro- 
tein plays a role in controlling the timing of lysis. 

Proteins unique to phage SflV 

There were several proteins in the SflV genome (high- 
lighted in purple. Figure 3), which did not have homo- 
logues in any other serotype converting phages of S, 
flexneri. These are: ORF-22, a tail fiber protein (discussed 
earlier), ORFs -23, -24 identified as insertion elements 
ISEhe3 Orf B, ISEhe3 Orf A, respectively, and ORFs -35 
and -36 proteins with no clear function. These two un- 
known proteins are potential candidates for future viru- 
lence studies as several bacteriophages are known to 
encode factors that alter host properties relevant to vari- 
ous stages of infection [37-39]. The region between 
antitermination Q and lytic cassette {orfs 47-49) was iden- 
tified as a moron. Like a typical moron, this region is 
flanked by transcription initiation and termination signals. 
Moreover, analysis of the GC content has revealed that 
this region has a significantly lower GC content (39.20%) 
than the rest of the SflV phage (50.29%). orf 47 encodes 
for a transposase (ISl Transposase B). Based on this, we 
hypothesize that orf-48 and orf-49, came with orf-47 in 
the SflV genome from another species. As morons are 
expressed in lysogens and are known to confer selective 
advantage to the host [16], it is reasonable to assume 
that the unknown proteins encoded by orfs -48 and -49 
could be involved in the pathogenesis of its host 
bacterium. 

Conclusions 

This is the first report on isolation of a serotype 
converting bacteriophage SflV from a wildtype S. flexneri 
4av strain. The complete genome sequence of phage SflV 
was determined, and the comparison between SflV and 
other S, flexneri serotype converting phages was obtained 
at the protein level. The analysis revealed that the SflV 
phage was more closely related to Sfll and SfV, than the 
rest of the phages. Nevertheless, SflV phage was found to 
contain five proteins which were not present in any other 
phages of S, flexneri and may potentially be involved in 
the pathogenesis of its host. Moreover, the comparison 
study also indicated that the general organization of the 
genes was similar in all the phages. Further molecular bio- 
logical studies based on the comparative genomic analyses 
of these phages will be useful in elucidating the evolution 
of these related phages. It will also provide essential 
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information for the development of new phage vectors 
and therapeutic agents to control infection caused by S. 
flexnerL 
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